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RAW SEQUENCE LISTING DATE: 10/28/2002 

PATENT APPLICATION: US/0 9/2 5 8 , 6 0 0 TIME: 15:04:48 

Input Set : N:\Crf3\RULE60\09258600.raw 
Output Set: N:\CRF4\10282002\l25 8600.raw 

SEQUENCE LISTING 

3 (1) GENERAL INFORMATION: 



5 (i) APPLICANT: FOWLKES , Dana M. , <~ v 

6 BROACH, Jim 4\< fr** 1 £ ? 

7 MANFRED I , John * ^ " " ^ 

8 KLEIN, Christine 

9 MURPHY , Andrew J. 

10 PAUL , Jeremy 

11 TRUEHEART, Joshua 

13 (ii) TITLE OF INVENTION: YEAST CELLS ENGINEERED TO PRODUCE 

14 PHEROMONE SYSTEM PROTEIN SURROGATES, AND USES THEREFOR 
16 (iii) NUMBER OF SEQUENCES: 119 

18 (iv) CORRESPONDENCE ADDRESS: 

19 (A) ADDRESSEE: BROWDY AND NEIMARK 

20 (B) STREET: 419 Seventh Street, N.W., Suite 300 

21 (C) CITY: Washington 
2 2 (D) STATE: D.C. 

2 3 (E) COUNTRY: USA 

24 (F) ZIP: 20004 

2 6 (v) COMPUTER READABLE FORM: 

27 (A) MEDIUM TYPE: Floppy disk 

28 (B) COMPUTER: IBM PC compatible 

29 (C) OPERATING SYSTEM: PC - DOS/MS - DOS 

30 ( D ) SOFTWARE: Patentln Release #1.0, Version #1.30 
32 (vi) CURRENT APPLICATION DATA: 

C--> 33 (A) APPLICATION NUMBER: US/0 9/2 5 8,600 

C--> 34 (B) FILING DATE: 26-Feb-1999 

35 (C) CLASSIFICATION: 

53 (vii) PRIOR APPLICATION DATA: 

38 (A) APPLICATION NUMBER: US/08/461,598 

39 (B) FILING DATE: 05-JUN-1995 

42 (A) APPLICATION NUMBER: US 08/322,137 

43 (B) FILING DATE: 13-OCT-1994 

46 (A) APPLICATION NUMBER: US 08/309,313 

47 (B) FILING DATE: 20-SEP-1994 

50 (A) APPLICATION NUMBER: US 08/190,328 

51 (B) FILING DATE: 31-JAN-1994 

54 (A) APPLICATION NUMBER: US 08/041,431 

55 (B) FILING DATE: 31-MAR-1993 
57 (viii) ATTORNEY/ AGENT INFORMATION: 

5 8 (A) NAME: COOPER, Iver P. 

59 ( B ) REGISTRATION NUMBER: 28,005 

60 (C) REFERENCE/DOCKET NUMBER: FOLWKES=2F 
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RAW SEQUENCE LISTING DATE: 10/28/200: 

PATENT APPLICATION: US/09/258,600 TIME: 15:04:48 

Input Set : N:\Crf3\RULE60\09258600.raw 
Output Set: N:\CRF4\10282002\l258600.raw 

62 (ix) TELECOMMUNICATION INFORMATION: 

63 (A) TELEPHONE: 202-628-5197 

64 (B) TELEFAX: 202-737-3528 
6 5 (C) TELEX: 24 86 3 3 

68 (2) INFORMATION FOR SEQ ID NO: 1: 

^0 (i) SEQUENCE CHARACTERISTICS: 

"\L (A) LENGTH: 89 amino acids 

n 2 ( B ) TYPE: amino acid 

7.3 (C) STRANDEDNESS : single 

74 (D) TOPOLOGY: linear 

76 (ii) MOLECULE TYPE: peptide 

79 (Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 

81 Met Arg Phe Pro Ser lie Phe Thr Ala Val Leu Phe Ala Ala Ser Ser 

82 1 5 10 15 

84 Ala Leu Ala Ala Pro Val Asn Thr Thr Thr Glu Asp Glu Thr Ala Gin 

85 20 25 30 

87 He Pro Ala Glu Ala Val He Gly Tyr Leu Asp Leu Glu Gly Asp Phe 

88 35 40 45 

90 Asp Val Ala Val Leu Pro Phe Ser Asn Ser Thr Asn Asn Gly Leu Leu 

91 50 55 60 

93 Phe He Asn Thr Thr He Ala Ser He Ala Ala Lys Glu Glu Gly Val 

94 65 70 75 80 

96 Ser Leu Asp Lys Arg Glu Ala Glu Ala 

97 85 

99 (2) INFORMATION FOR SEQ ID NO: 2: 

101 (i) SEQUENCE CHARACTERISTICS: 

102 (A) LENGTH: 76 amino acids 

103 (B) TYPE: amino acid 

104 (D) TOPOLOGY: linear 
106 (ii) MOLECULE TYPE: peptide 

109 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

111 Trp His Trp Leu 

112 1 

114 Ala Glu Ala Glu 

115 20 

117 Met Tyr Lys Arg 

118 3 5 

120 Lys Pro Gly Gin 

121 50 

123 His Trp Leu Gin 

124 65 70 75 
126 (2) INFORMATION FOR SEQ ID NO: 3: 

128 (i) SEQUENCE CHARACTERISTICS: 

129 (A) LENGTH: 15 base pairs 

130 (B) TYPE : nucleic acid 

131 (C) STRANDEDNESS: double 

132 (D) TOPOLOGY: linear 
W--> 134 (ii) MOLECULE TYPE: synthetic DNA 

137 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 
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RAW SEQUENCE LISTING DATE: 10/28/2002 

PATENT APPLICATION: US/09/258,600 TIME: 15:04:48 

Input Set : N:\Crf3\RULE60\09258600.raw 
Output Set: N:\CRF4\10282002\l258600.raw 

139 AAGCTTAAAA GAATG 15 

141 (2) INFORMATION FOR 3EQ ID NO: 4: 
14 3 (i) SEQUENCE CHARACTERISTICS: 

144 (A) LENGTH: 37 base pairs 

145 ( B ) TYPE: nucleic acid 

146 (C) S IRANDEDNESS : single 
14" 7 (D) TOPOLOGY: linear 

14 9 (ii) MOLECULE TYPE: cDNA 

152 (ix) FEATURE: 

153 (A) NAME/KEY: CDS 

154 (B) LOCATION : 1 . . 24 

157 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

159 AAA GAA GAA GGG GTA TCT TTG CTT AAGCTCGAGA TCT 37 

160 Lys Glu Glu Gly Va 1 Ser Leu Leu 
1611 5 

164 (2) INFORMATION FOR SEQ ID NO: 5: 

166 (i) SEQUENCE CHARACTERISTICS: 

167 (A) LENGTH: 8 amino acids 

168 ( B ) TYPE: amino acid 

169 (D) TOPOLOGY: linear 
171 (ii) MOLECULE TYPE: peptide 

173 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

175 Lys Glu Glu Gly Va L Ser Leu Leu 

176 1 5 

178 (2) INFORMATION FOR SEQ ID NO: 6: 

180 (i) SEQUENCE CHARACTERISTICS: 

181 (A) LENGTH: 77 base pairs 

182 (B) TYPE: nucleic acid 

183 (C) STRANDEDNESS: double 

184 (D) TOPOLOGY: linear 

W--> 186 (ii) MOLECULE TYPE: synthetic DNA 

189 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

191 CGTGAAGCTT AAGCGTGAGG CAGAAGCTNN KNNKNNKNNK NNKNNKNNKN NKNNKNNKNN 60 
193 KNNKNNKTGA TCATCCG 77 
195 (2) INFORMATION FOR SEQ ID NO: 7: 

197 (i) SEQUENCE CHARACTERISTICS: 

198 (A) LENGTH: 19 amino acids 

199 (B) TYPE: amino acid 

200 (D) TOPOLOGY: linear 
202 (ii) MOLECULE TYPE: peptide 

205 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

W--> 207 Lys Arg Glu Ala Glu Ala Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 

208 15 10 15 

W--> 210 Xaa Xaa Xaa 

214 (2) INFORMATION FOR SEQ ID NO: 8: 

216 (i) SEQUENCE CHARACTERISTICS: 

217 (A) LENGTH: 36 amino acids 

218 (B) TYPE: amino acid 

219 ( D ) TOPOLOGY: linear 
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RAW SEQUENCE LISTING DATE: 10/28/2002 

PATENT APPLICATION: US/09/258,600 TIME: 15:04:48 

Input Set : N:\Crf3\RULE60\092 5 8 60 0.raw 
Output Set: N:\CRF4\10282002\l258600.raw 



221 (ii) MOLECULE TYPE: peptide 

224 (xi) SEQUENCE DESCRIPTION : SEC ID NO: 8: 

226 Met Gin Pro Ser Thr Ala Thr Ala Ala Pro Lys Glu Lys Thr Ser Ser 

227 15 10 15 

22'^ Glu Lys Lys Asp Asn Tyr lie lie Lys Gly Val Phe Trp Asp Pro Ala 

2 30 2 0 2 5 3 0 

232 Cys Val He Ala 

2 3 3 3 5 

235 (2) INFORMATION FOR SEC I D NO: 9: 

237 (i) SEQUENCE CHARACTERISTICS: 

238 (A) LENGTH: 19 base pairs 

239 (B) TYPE: nucleic acid 

240 (C) STRANDEDNESS : single 

241 (D) TOPOLOGY: linear 

W--> 24 3 (ii) MOLECULE TYPE: synthetic DNA 

24b (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

24 8 AAGCTTTCGA ATAGAAATG 19 

2 50 (2) INFORMATION FOR SEQ ID NO: 10: 

252 (i) SEQUENCE CHARACTERISTICS: 

253 (A) LENGTH: 36 base pairs 

254 (B) TYPE: nucleic acid 

2 55 (C) STRANDEDNESS: double 

2 56 (D) TOPOLOGY: linear 

W--> 25 8 (ii) MOLECULE TYPE: synthetic DNA 

261 (ix) FEATURE: 

262 (A) NAME/KEY: CDS 

263 (B) LOCATION: 1. .27 

266 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

268 GCC GCT CCA AAA GAA AAG ACC TCG AGC TCGCTTAAG 3 6 

269 Ala Ala Pro Lys Glu Lys Thr Ser Ser 

270 1 5 

27 3 (2) INFORMATION FOR SEQ ID NO: 11: 

275 (i) SEQUENCE CHARACTERISTICS: 

276 (A) LENGTH: 9 amino acids 

277 (B) TYPE: amino acid 

278 (D) TOPOLOGY: linear 
280 (ii) MOLECULE TYPE: peptide 

282 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

284 Ala Ala Pro Lys GLu Lys Thr Ser Ser 

285 1 5 

287 (2) INFORMATION FOR SEQ ID NO: 12: 

289 (i) SEQUENCE CHARACTERISTICS: 

290 (A) LENGTH: 79 base pairs 

291 (B) TYPE: nucleic acid 

292 (C) STRANDEDNESS: double 

293 (D) TOPOLOGY: linear 
295 (ii) MOLECULE TYPE: cDNA 

298 (Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

300 GGTACTCGAG TGAAAAGAAG GACAACNNKN NKNNKNNKNN KNNKNNKNNK NNKNNKNNKT 60 
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RAW SEQUENCE LISTING DATE: 10/28/2002 

PATENT APPLICATION: US/0 9/2 5 8 , 6 0 0 TIME: 15:04 48 

Input Set : N:\Crf3\RULE60\092 58600.raw 
Output Set: N:\CRF4\10282002\l258600.raw 

302 GTGTTATTGC TTAAGTACG 79 

3 OS (2) INFORMATION FOR SEQ ID NO: 13: 

30b (i) SEQUENCE CHARACTERISTICS: 

30 7 (A) LENGTH: 22 amino acids 

308 (B) TYPE: amino acid 
30M (D) TOPOLOGY: linear 

3 11 (ii) MOLECULE TYPE: peptide 

3 14 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

W- - > 3L6 Ser Ser Glu Lys Lys Asp Asn Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 

317 15 10 15 

W--> 319 Xaa Cys Val lie Ala 

320 20 

32 3 (2) INFORMATION FOR SEQ ID NO: 14: 

32 5 (i) SEQUENCE CHARACTERISTICS: 

326 (A) LENGTH: 34 base pairs 

327 (B) TYPE: nucleic acid 

328 (C) STRANDEDNESS: single 

329 (D) TOPOLOGY: linear 

W--> 331 (ii) MOLECULE TYPE: synthetic DNA 

3 33 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

3 3 5 GTTAAGAACC ATATACTAGT ATCAAAAATG TCTG 3 4 

3 38 (2) INFORMATION FOR SEQ ID NO: 15: 

340 (i) SEQUENCE CHARACTERISTICS: 

341 (A) LENGTH: 35 base pairs 

342 (B) TYPE: nucleic acid 

343 (C) STRANDEDNESS: single 

344 (D) TOPOLOGY: linear 

W--> 34 6 (ii) MOLECULE TYPE: synthetic DNA 

349 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

3 51 TGATCAAAAT TTACTAGTTT GAAAAAGTAA TTTCG 3 5 

353 (2) INFORMATION FOR SEQ ID NO: 16: 

355 (i) SEQUENCE CHARACTERISTICS: 

3 r ^6 (A) LENGTH: 28 base pairs 

357 (B) TYPE: nucleic acid 

358 (C) STRANDEDNESS: single 

359 (D) TOPOLOGY: linear 

W--> 361 (ii) MOLECULE TYPE: synthetic DNA 

364 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

366 GGCAAAATAC TAGTAAAATT TTCATGTC 2 8 

368 (2) INFORMATION FOR SEQ ID NO : 17: 

370 (i) SEQUENCE CHARACTERISTICS: 

371 (A) LENGTH: 34 base pairs 

372 (B) TYPE: nucleic acid 

373 (C) STRANDEDNESS: single 

3 7 4 (D) TOPOLOGY: linear 

W--> 376 (ii) MOLECULE TYPE: synthetic DNA 

379 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

381 GGCCCTTAAC ACACTAGTGT CGCATTATAT TTAC 34 

383 (2) INFORMATION FOR SEQ ID NO: 18: 
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RAW SEQUENCE LISTING ERROR SUMMARY 

PATENT APPLICATION: US/09/258,600 



DATE : 10/28/2002 
TIME : 15:04:49 



Input Set : N:\Crf3\RULE60\09258600.raw 
Output Set: N:\CRF4\10282002\l258600.raw 



Please Note: 

Use of n and/or Xaa have been detected in the Sequence Listing. Please review the 
Sequence Listing to ensure that a corresponding explanation is presented in the <220> 
to <223> fields of each sequence which presents at least one n or Xaa. 

Seq* : 6; N Pos . 29,3 0,32,33,3 5,36,38,39,41,42,44,45,47,48,50,51,53,54,56,57 
Seq*:6; N Pos. 5 9,60,62,63,65,66 

Seq#: 7 ; Xaa Pos . 7 , 8 , 9 , 1 0 , 1 1 , 1 2 , 1 3 , 1 4 , 1 5 , 1 6 , 1 7 , 1 8 , 1 9 

Seq* : 12 ; N Pos . 27,28,30,31,3 3,34,36,37,39,40,42,43,45,46,48,49,51,52,54,55 
Seq# : 12 ; N Pos . 57,58 

Seq#:13; Xaa Pos . 8 , 9 , 10 , 1 1 , 1 2 , 1 3 , 1 4 , 1 5 , 1 6 , 1 7 , 1 8 

Seq# : 2 7; N Pos . 12,13,15,16,18,19,21,22,24,25,27,28,30,31,33,34,36,37,39,40 
Seq#:27; N Pos. 4 2,43,45,46,48,49 

Seq# : 29; N Pos . 22, 23, 25, 26, 28, 29, 31, 3 2, 34, 35, 37, 38, 40, 41, 43,44, 46, 47, 49, 50 
Seq# : 29 ; N Pos . 52 , 53 

Seq# : 39; N Pos . 19, 20, 22, 23, 2 5, 2 6, 28, 29, 31, 32, 34, 35, 37, 38, 40, 41, 43, 44, 46, 47 
Seq#:39; N Pos. 49,50,52,53 
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VERIFICATION SUMMARY 

PATENT APPLICATION: US/09/258,600 



DATE: 10/28/200. 
TIME: 15:04:49 



Input Set : N:\Crf3\RULE60\09258600.raw 
Output Set: N:\CRF4\10282002\I258600.raw 



33 
3 4 
134 
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210 
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2^8 
3 16 
3 1 9 
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3 61 
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2 0 C: 
20 C: 
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246 W 
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336 W 
3 36 W 
: 336 
: 336 
: 3 3 6 W 
: 336 
: 336 



Keyword misspelled or invalid format, [(A) APPLICATION NUMBER:] 
Keyword misspelled or invalid format, [(B) FILING DATE:] 
Invalid value of Alpha Sequence Header Field, [MOLECULE TYPE: 
Invalid value of Alpha Sequence Header Field, [MOLECULE TYPE: 
(46) "n" or "Xaa" used, for SEQ ID# : 7 after pos . : 0 
(46) "n" or "Xaa" used, for SEQ ID# : 7 after pos.:I6 

[MOLECULE TYPE: 



: 336 
: 3 36 
: 3 3 6 
3 3 6 



Invalid value of Alpha Sequence Header Field, 
Invalid value of Alpha Sequence Header Field, 



[MOLECULE TYPE: 



(46) 
(46) 



or "Xaa" used, 
or "Xaa" used, 



for SEQ ID#:13 after pos . : 0 
for SEQ ID#:13 after pos.:17 
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Invalid Amino Acid Number in Coding Region, SEQ ID: 46 

Invalid Amino Acid Number in Coding Region, SEQ ID: 48 

Invalid Amino Acid Number in Coding Region, SEQ ID: 50 

Invalid Amino Acid Number in Coding Region, SEQ ID: 52 
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